Search CORE

119 research outputs found

MC64: A web platform to test bioinformatics algorithms in a many-core architecture

Author: A. Driga
A.M. Waterhouse
D. Díaz
D.S. Hirschberg
J.D. Thompson
K.-B. Li
S. Altschul
S. Gálvez
S.B. Needleman
T.F. Smith
Publication venue: Springer Berlin / Heidelberg
Publication date: 01/01/2011
Field of study

New analytical methodologies, like the so-called "next-generation sequencing" (NGS), allow the sequencing of full genomes with high speed and reduced price. Yet, such technologies generate huge amounts of data that demand large raw computational power. Many-core technologies can be exploited to overcome the involved bioinformatics bottleneck. Indeed, such hardware is currently in active development. We have developed parallel bioinformatics algorithms for many-core microprocessors containing 64 cores each. Thus, the MC64 web platform allows executing high-performance alignments (Needleman-Wunsch, Smith-Waterman and ClustalW) of long sequences. The MC64 platform can be accessed via web browsers, allowing easy resource integration into third-party tools. Furthermore, the results obtained from the MC64 include time-performance statistics that can be compared with other platform

Repositorio Institucional de la Universidad de Córdoba

Crossref

Digital.CSIC

Towards Reliable Automatic Protein Structure Alignment

Author: A. Caprara
A. Zemla
A.G. Murzin
A.S. Konagurthu
C.A. Rohl
C.B. Do
G. Lancia
H.M. Berman
I.N. Shindyalov
J. Shi
J. Xu
J.F. Gibrat
K. Mizuguchi
L. Kinch
L. Xie
M. Comin
M. Levitt
M. Moakher
M. Sadowski
N.M. Daniels
N.N. Alexandrov
S. Henikoff
S. Subbiah
S.B. Needleman
S.B. Pandit
S.R. Eddy
W. Pirovano
Y. Yang
Y. Ye
Y. Zhang
Y. Zhang
Y. Zhang
Publication venue
Publication date: 01/01/2013
Field of study

A variety of methods have been proposed for structure similarity calculation, which are called structure alignment or superposition. One major shortcoming in current structure alignment algorithms is in their inherent design, which is based on local structure similarity. In this work, we propose a method to incorporate global information in obtaining optimal alignments and superpositions. Our method, when applied to optimizing the TM-score and the GDT score, produces significantly better results than current state-of-the-art protein structure alignment tools. Specifically, if the highest TM-score found by TMalign is lower than (0.6) and the highest TM-score found by one of the tested methods is higher than (0.5), there is a probability of (42%) that TMalign failed to find TM-scores higher than (0.5), while the same probability is reduced to (2%) if our method is used. This could significantly improve the accuracy of fold detection if the cutoff TM-score of (0.5) is used. In addition, existing structure alignment algorithms focus on structure similarity alone and simply ignore other important similarities, such as sequence similarity. Our approach has the capacity to incorporate multiple similarities into the scoring function. Results show that sequence similarity aids in finding high quality protein structure alignments that are more consistent with eye-examined alignments in HOMSTRAD. Even when structure similarity itself fails to find alignments with any consistency with eye-examined alignments, our method remains capable of finding alignments highly similar to, or even identical to, eye-examined alignments.Comment: Peer-reviewed and presented as part of the 13th Workshop on Algorithms in Bioinformatics (WABI2013

arXiv.org e-Print Archive

Crossref

Soft topographic map for clustering and classification of bacteria

Author: G.M. Garrity
H. Klock
I.T. Joliffe
J.D. Thompson
J.E. Clarridge III
K. Rose
M. Drancourt
M. Drancourt
M. Drancourt
M. Remm
P. Rice
S. Altschul
S. Dubnov
S. Kumar
S.B. Needleman
S.P. Luttrell
T. Graepel
T. Graepel
T. Hofmann
T. Hofmann
T. Kohonen
T. Kohonen
T.H. Jukes
W.S. Torgerson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

In this work a new method for clustering and building a topographic representation of a bacteria taxonomy is presented. The method is based on the analysis of stable parts of the genome, the so-called “housekeeping genes”. The proposed method generates topographic maps of the bacteria taxonomy, where relations among different type strains can be visually inspected and verified. Two well known DNA alignement algorithms are applied to the genomic sequences. Topographic maps are optimized to represent the similarity among the sequences according to their evolutionary distances. The experimental analysis is carried out on 147 type strains of the Gammaprotebacteria class by means of the 16S rRNA housekeeping gene. Complete sequences of the gene have been retrieved from the NCBI public database. In the experimental tests the maps show clusters of homologous type strains and present some singular cases potentially due to incorrect classification or erroneous annotations in the database

Central Archive at the University of Reading

CiteSeerX

Crossref

Archivio istituzionale della ricerca - Università di Palermo

SimSearch: A new variant of dynamic programming based on distance series for optimal and near-optimal similarity discovery in biological sequences

Author: A. Lefebvre
B. Ma
H. Peltola
H.-N. Teodorescu
Henikoff
J.B. Kruskal
M. Li
M.V. José
O. Gotoh
P.H. Sellers
R. Kolpakov
R.E. Bellman
S.B. Needleman
S.F. Altschul
S.F. Altschul
T. Schmidt
T.F. Smith
W.R. Pearson
X. Huang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

http://www.informatik.uni-trier.de/%7Eley/db/conf/iwpacbb/iwpacbb2008.htmlIn this paper, we propose SimSearch, an algorithm implementing a new variant of dynamic programming based on distance series for optimal and near-optimal similarity discovery in biological sequences. The initial phase of SimSearch is devoted to fulfil the binary similarity matrices by signalling the distances between occurrences of the same symbol. The scoring scheme is further applied, when analysed the maximal extension of the pattern. Employing bit parallelism to analyse the global similarity matrix’s upper triangle, the new methodology searches the sequence(s) for all the exact and approximate patterns in regular or reverse order. The algorithm accepts parameterization to work with greater seeds for near-optimal results. Performance tests show significant efficiency improvement over traditional optimal methods based on dynamic programming. Comparing the new algorithm’s efficiency against heuristic based methods, equalizing the required sensitivity, the proposed algorithm remains acceptable.This work has been partially supported by PRODEP

Universidade do Minho: RepositoriUM

Crossref

Biblioteca Digital do IPB

Computer-aided design of nano-filter construction using DNA self-assembly

Author: A. Carbone
A.P. Alivisatos
B.H. Robinson
C.A. Mirkin
C.F. Monson
D.M. Hawkins
E. Braun
E.K. Freyhult
E.W. Myers
F. Yoshida
G. Braun
H. Qiu
Hassan Mohabatkar
I. Willner
J. Richter
J.D. Watson
K. Keren
K. Keren
M. Liu
M. Mertig
M.A. Batalia
N.C. Seeman
N.C. Seeman
N.C. Seeman
O. Gotoh
R.P. Fahlman
Reza Mohammadzadegan
S. Chomet
S.B. Needleman
T.F. Smith
T.G. Drummond
Z.X. Deng
Publication venue: Springer
Publication date: 01/01/2006
Field of study

Computer-aided design plays a fundamental role in both top-down and bottom-up nano-system fabrication. This paper presents a bottom-up nano-filter patterning process based on DNA self-assembly. In this study we designed a new method to construct fully designed nano-filters with the pores between 5 nm and 9 nm in diameter. Our calculations illustrated that by constructing such a nano-filter we would be able to separate many molecules

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Supporting Tabular Data Characterization in a Large Scale Data Infrastructure by Lexical Matching Techniques

Author: C. Bizer
D. Castelli
D.D. Roure
G. Crane
J.C. Wallis
L. Candela
L. Candela
L.K. Stapleton
M.A. Jaro
P. Nowakowski
P.V. Gorp
R. Shen
R.W. Hamming
S.B. Needleman
T. Blanke
T.F. Smith
V. Levenshtein
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Crossref

Rigid and Non-rigid Shape Matching for Mechanical Components Retrieval

Author: B. Jian
B.K.P. Horn
D. Lowe
D.L. Woodard
D.L. Woodard
E. Lilienblum
G. Zou
G.E. Dieter
H. Bay
H. Chen
H. Takeuchi
J. Kim
J. Nihtila
M. Salzmann
M.R. Millson
P.J. Besl
Q. Li
R. Plänkers
S.B. Needleman
S.C. Park
S.M. Smith
T. Smith
T.S. Caetano
V.A. Mabert
X.F. Zha
Publication venue: Springer
Publication date: 01/01/2012
Field of study

Reducing the setup time for a new production line is critical to the success of a manufacturer within the current competitive and cost-conscious market. To this end, being able to reuse already available machines, toolings and parts is paramount. However, matching a large warehouse of previously engineered parts to a new component to produce, is often more a matter of art and personal expertise rather than predictable science. In order to ease this process we developed a database retrieval approach for mechanical components that is able to deal with both rigid matching and deformable shapes. The intended use for the system is to match parts acquired with a 3D scanning system to a large database of components and to supply a list of results sorted according with a metric that expresses a structural distance. © 2012 IFIP International Federation for Information Processing

Archivio Ricerca Ca'Foscari

Crossref

Archivio istituzionale della ricerca - Università degli Studi di Venezia Ca' Foscari